Factors affecting speech retrieval
نویسندگان
چکیده
Collections of speech documents can be searched using speech retrieval, in which the documents are processed by a speech recogniser to give text that can be searched by standard text retrieval techniques. Recognition is the translation of speech signals into either words or subword units such as phonemes. We investigated the use of a phoneme-based recogniser to obtain phoneme sequences. We found that phoneme recognition is worse than word recognition, because of lack of context and di culty in phoneme boundary detection. Comparing the transcriptions of two di erent phonemebased recogniser, we found that the e ects of training using well-de ned phoneme data, the lack of a language model, and lack of a context-dependent model a ected recognition performance. Retrieval was based on n-grams. We found that trigrams performed better than quadgrams because the longer n-gram features contained too many transcription errors. Comparing the phonetic transcriptions from a word recogniser to transcriptions from a phoneme recogniser, we found that using 61 phones modelled with an algorithmic approach were better than using 40 phones modelled with a dictionary approach.
منابع مشابه
Factors Affecting Student's Scientific Information Retrieval based on Fuzzy Logic Method Compared to Traditional Method
Background and aim: The aim of this study was to identify the factors affecting on students' performance in information retrieval based on fuzzy logic method compared to traditional method. Materials and methods: This survey-descriptive study was performed using quantitative approach. The research population was 34 PhD students, and the researcher-made questionnaire was used. Data were analyzed...
متن کاملFast vector quantization based on subcodebook selection and its application to speech recognition
Vector quantization (VQ) is a efficient technique for data compression with a minimum distortion. VQ is widely used in applications as speech and image coding, speech recognition, and image retrieval. This paper presents a novel fast nearestneighbor algorithm and shows its application to speech recognition. The proposed algorithm is based on a fast preselection that reduces the search to a limi...
متن کاملFactors affecting choice of speech over keyboard and mouse in a simple data-retrieval task
This paper describes some recent experiments that assess user mode selection behavior in amulti-modal environment inwhich actions can be performed with equivalent effect by speech, keyboard or scroller. Results indicate that users freely choose speech over other modalities, even when it is less efcient in objective terms, such as time-to-completion or input error. Additional evidence indicates ...
متن کاملNearest-neighbor search algorithms based on subcodebook selection and its application to speech recognition
Vector quantization (VQ) is a efficient technique for data compression with a minimum distortion. VQ is widely used in applications as speech and image coding, speech recognition, and image retrieval. This paper presents a novel fast nearestneighbor algorithm and shows its application to speech recognition. The proposed algorithm is based on a fast preselection that reduces the search to a limi...
متن کاملInvestigating Factors Affecting the Quality of Virtual Education from the point of Professors and Students of Rehabilitation Fields of Ahvaz Jundishapur University of Medical Sciences during COVID-19
Introduction: The closing down of schools and universities due to the COVID-19 outbreak led to introducing virtual education. Considering the importance of virtual education during coronavirus pandemic, this study aimed to investigate the factors affecting the quality of virtual education from the viewpoint of professors and students of rehabilitation fields of Ahvaz Jundishapur University of M...
متن کامل